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HIGH LEVEL OF EXPRESSION OF INGAP 
IN BACTERIAL AND EURARYOTIC CELLS 

This is a continuation-in-part application of U.S. Ser No 
08/741,096, filed Oct. 30, 1996, now abandoned. 5 

TECHNICAL FIELD OF THE INVENTION 

This invention is related to methods and constructs for 
achieving high level expression of INGAP, a protein 
involved in islet cell neogenesis. 10 

BACKGROUND OF THE INVENTION 

Pancreatic islets of Langerhans are the only organs of 
insulin production by p cells in the body. However, they 15 
have a limited capacity for regeneration. This limited regen- 
eration capacity predisposes mammals to develop diabetes 
mellitus. Thus there is a need in the art of endocrinology for 
products which can stimulate the regeneration of islets of 
Langerhans to prevent or ameliorate the symptoms of dia- 20 
betes mellitus. 

There are many factors regulating pancreatic (3 cell mass. 
(Vinik, et al., Diabetes Reviews 4: 235-263, 1996.) A 
pancreatic extract called ilotropin induces p cell regenera- 
tion and reverses diabetes. (Rosenberg et al. (1996) Diabe- 25 
tologia 39: 256-262. A gene encoding a protein within 
ilotropin has been identified and isolated; the protein is 
responsible for stimulating islet cell regeneration. 
(Rafaeloff, R. Journal of Clinical Investigations 99: 
2100-2109, 1997.) This protein is called INGAP, and is 30 
disclosed in patent applications Ser. Nos. 08/401,530, 
08/709,662, and 60/006,271. The disclosure of these appli- 
cations is expressly incorporated herein. Despite the knowl- 
edge of the complete nucleotide sequence of the INGAP 
gene, expression of the protein has been limited. Thus there 35 
is a need in the art for methods of expressing and isolating 
large quantities of the INGAP protein, especially in eukary- 
otic systems. 

SUMMARY OF THE INVENTION 40 

It is an object of the present invention to provide a method 
of producing biologically active INGAP protein from a 
recombinant host cell. 

It is another object of the present invention to provide a 45 
host cell which expresses large amounts of INGAP protein. 

It is an object of the present invention to provide a 
recombinant construct for expression of biologically active 
INGAP protein. 

Another object of the invention is to provide a method for 50 
isolating INGAP protein from a recombinant host cell. 

These and other objects of the invention are achieved by 
providing the art with a recombinant construct for expres- 
sion of biologically active INGAP protein comprising: 

a first nucleotide sequence encoding amino acids 27 to 
175 SEQ ID NO: 6 operably linked to a transcriptional 
initiation site and a translation^ initiation site, wherein a 
second nucleotide sequence encoding a signal peptide is not 
present immediately 5 1 of said first nucleotide sequence. ^ 

In another embodiment of the invention a method of 
producing INGAP activity from a recombinant host cell is 
provided. The method comprises the steps of: 

culturing a host cell comprising a recombinant construct 
comprising a first nucleotide sequence encoding amino acids 65 
27 to 175 SEQ ID NO: 6 operably linked to a transcriptional 
initiation site and a translational initiation site, wherein a 



2 

second nucleotide sequence encoding a signa! peptide is not 
present immediately 5 f of said first nucleotide sequence; 
recovering protein from said cultured host celL 
In yet another embodiment of the invention a host cell is 
provided. The host cell comprises a recombinant construct 
comprising a first nucleotide sequence encoding amino acids 
27 to 175 SEQ ID NO: 6 operably linked to a transactional 
initiation site and a translational initiation site, wherein a 
iq second nucleotide sequence encoding a signal peptide is not 
present immediately 5' of said first nucleotide sequence. 
These and other embodiments of the invention which will be 
apparent to those of skill in the art provide a practical source 
of INGAP protein in amounts suitable for use in preclinical 
and clinical situations, 

15 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. L SDS-PAGE gel of products of bacterial transfec- 
tion. Bacterial lysate without transfection (CBL), bacterial 
20 lysate with transfection (TBL), fractions from Ni-NTA chro- 
matography (eluted at pH63 (6.3); pH 5.9 (5.9); and pH 4.5 
(45) and standards (Std). 

FIG. 2. ECL film of Western blot using INGAP antibody 
945-2. Lanes are as identified in the description to FIG. 1. 

25 

DETAILED DESCRIPTION 

It is a discovery of the present inventors that bacterial 
expression as well as eukaryotic expression of INGAP can 

30 be achieved at high levels by deleting the coding sequence 
of the signal sequence of INGAP. While not wanting to be 
bound by any particular theory or mechanism of action, 
applicants believe that the signal sequence is tone to host 
systems. The signal sequence comprises amino acids 1 to 26 

35 as shown in SEQ ID NO: 5. In the constructions tested, the 
5 1 untranslated region comprising nucleotides 1-16 SEQ ID 
NO. 1 has also been deleted. This deletion may also con- 
tribute to the increase in expression which has been 
observed. 

40 Applicants have found that an inducible transcription 
initiator is exceedingly useful for INGAP expression in 
prokaryotic systems. Suitable inducible transcription initia- 
tors include the lac promoter/operator, the tac promoter, the 
trp promoter, the kcl promoter, the tet promoter, as well as 

45 others which are known in the art. 

According to another aspect of the invention, a histidine 
tag can be put on the protein. The histidine tag can simplify 
processing and purification. A histidine tag is a stretch of 
histidine residues which is appended to a protein, usually by 

50 genetic engineering. Preferably the tag comprises between 3 
and 12 histidine residues. They may be contiguous or 
interrupted by other residues. The histidine tag may be 
appended to the N-terminal or to the C-terminal end of the 
protein to minimize disruption of protein function. Methods 

55 for making and utilizing histidine tags are known in the art. 
The oligohistidine can be used as an affinity moiety using a 
metal chelate, such as nickel-NTA (N-(5-amino-l- 
carboxypentyl)-iminodiacetic acid) as the other affinity part- 
ner. 

60 A recombinant construct according to the invention, is 
any DNA molecule which has been engineered so that two 
segments of DNA are adjacent to each other which are not 
adjacent to each other in nature. Preferably such engineering 
is performed in vitro, although in vivo engineering can also 

65 be performed. The construct may be a plasmid, phage, virus, 
transposable element, minichromosome, or other element, as 
is suitable for the desired application. 
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In a preferred embodiment, the construct for INGAP 
expression in eukaryotic systems contains an origin of 
replication, e.g., EBV oriP, which permits extrachromo- 
somal maintenance in primate and canine cell lines. A 
sequence element which encodes a nuclear antigen and 5 
allows high level episomal replication, e.g., EBNA-1 or 
SV40 T antigen is also desirable. 

The coding sequence of amino acid residues 27-175 of 
INGAP protein are included in the constructs. Preferably the 
entire signal sequence is deleted. However, it is possible that 10 
only a portion of the signal sequence must be deleted to 
obtain excellent expression. Thus some portion of the signal 
sequence might be retained in the constructs. 

Deletion of the 5' untranslated region, nucleotides 1-16, 
is also desirable. However, it is not known if this is neces- 15 
sary to achieve excellent expression. Thus the 5' untranslated 
region may be retained in some constructs without departing 
from the spirit of the invention. 

A host cell according to the invention can be transfected 2Q 
or transformed with a recombinant construct according to 
the present invention. The host may be a bacteria, yeast, 
insect, or mammalian cell. For eukaryotic expression of 
INGAP, any cell lines suitable for protein expression may be 
used, including COS-7 cells and CHO cells. ^ 

Selection of suitable promoters and translational initiators 
for use in the appropriate host cell is well within the ability 



of those skilled in the art. For eukaryotic expression system, 41 
it is exceedingly useful to choose a promoter sequence 
which is capable of initiating constitutive transcription to 
achieve constitutive high level expression of the protein. 
Rous sarcoma virus long terminal repeat (RSVLTR) is an 
example of such promoter, although others as are known in 4f 
the art can be used. 

Host cells may be transformed, transfected, mated or 
infected with the recombinant host cell of the present 
invention. Culturing of host cells can be performed using 5C 
techniques and media which are well known in the art. 
Again, a suitable medium and technique can be selected by 
an ordinary skilled artisan. 

The above disclosure generally describes the present 
invention. A more complete understanding can be obtained 55 
by reference to the following specific examples which are 
provided herein for purposes of illustration only, and are not 
intended to limit the scope of the invention. 

EXAMPLE 1 

This example describes the experimental design 
employed. 

We generated a new INGAP cDNA by PCR which 
excluded the 5' UTR region (nucleotides 1-16 in SEQ ID: 1) 
and nucleotides encoding the signal peptide (nucleotides 65 
17-94 SEQ ID NO: 1) and created two new restriction 
enzyme recognition sites enabling the insertion of the new 
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construct into a new pQE-31 expression vector. This new 
hgated construct was transformed into TOPI OP competent 
cells (E. coli host strain from Invitrogen). The positive 
clones were identified, verified by restriction enzyme diges- 

5 tion and the DNA isolated. The DNAwas now transformed 
into a different competent E. coli strain, M15(pREP4) and 
expression of the protein was induced by IPTG (isopropyl- 
beta, D-thiogalactopyranoside) which inhibits the repressor, 
facilitating expression of the protein from the M15 

10 promoter/operator. The reason for the intermediate transfor- 
mation of the ligated material into TOP10P is that these 
cells are highly competent increasing the odds of getting 
insert positive colonies. The M15(pREP4) cells that were 
used for protein expression do not attain competency levels 

15 high enough to guarantee transformation of the ligation 
products. The resultant plasmid DNA obtained from the 
transformation of the TOP10P was sufficient to enhance the 
transformation of the M15(pREP4) cells. The His-tagged 
protein was isolated by Ni +2 agarose affinity purification. 

20 We used a PCR approach to generate a new INGAPcDNA 
which excludes the 5 1 UTR region (nucleotides 1-16 in SEQ 
ID NO: 1) and nucleotides encoding the signal peptide. 

The nucleotide sequence (SEQ ID NO: 1) and corre- 
25 sponding amino acid sequence (SEQ ID NO: 5) that have 
been excluded are as follows: (the bolded area represents the 
sequence of the signal peptide) 

CTGCAAGACAGGTACC ATG ATG CTT CCC ATG ACC CTC TGT AGG 
MET MET Leu Pro MET The Leu Cys Arg 
ATG TCT TGG ATG CTG CTT TCC TGC CTG ATG TTC CTT TCT TGG 
MET Ser Trp MET Leu Leu Ser Cys Leu MET Phe Leu Ser Trp 
GTG GAA GGT 
Val Glu Gly 

W EXAMPLE 2 

This example describes the use of polymerase chain 
reaction to synthesize INGMAT (a construct which lacks the 
45 signal peptide sequence, i.e., which encodes the mature 
protein). 

Oligonucleotide design: 

50 Oligonucleotides for PCR were designed to incorporate 
restriction enzyme recognition sites at their respective 5* 
ends. The oligonucleotide designed for the 5' end of the gene 
incorporates a Bam HI site followed by 20 nucleotides 
corresponding to the N-terminus of the mature protein. The 

55 oligonucleotide designed for the 3 1 end incorporates an Xho 
I site followed by 20 untranslated nucleotides. The PCR 
product generated from these primers contains the mature 
INGAP sequence and the native protein termination codon. 

60 The following is the sequence of the oligonucleotides 
used: 

5' of INGAP (SEQ ID NO: 2) 

S'-CCGCGGArCCCGAAGAATCTCAAAAGAAACT-^ 
65 3'ofINGAP(SEQIDNO:3) 

5 1 -GACCGGCTCGAGTGCTCrTCCTGAGTGAArCC-3' 
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PCR of INGMAT 
Reaction conditions 



Template: (50 ng INGAF original cDNA 5 fd 
removed from pCDNA3) 

MgCl 2 : 4 jA 

10 X PCR buffer 5 fd 

dATP 1 fd 

dCTP 1 fd 

dGTP 1 fd 

dTTP i /a 

5' primer 1 fd 

y primer 1 fd 

H 2 0 29 /d 

Taq polymerase 1 fd 

total volume - 50 fd 
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cycle parameters 

A) 2 min at 95° C. 

B) 30 cycles of (1 min 95° C, 1 min 55° C, 1 min 72° 

co 

Q 7min at 72° C. 

D) 4° C. until removed from thermal cycler. 

The PCR products were then electrophoresed on a 5% 25 
PAGE in TBE. Ethidium bromide stained PCR products 
corresponding to the expected size for the construct were cut 
from the gel. The gel fragments were electro-eluted into 0.5 
ml of TBE, precipitated with 50 3M sodium acetate and ^ 
1 ml of isopropanol at -80° C. for 20 min, centrifuged, 
washed once with 1 ml of isopropanol, washed once with 1 
ml of 70% ethanol, and then dried under vacuum. The dried 
pellet was resuspended in 50 ju\ H 2 0 and quantified. At the 
end of this step the sequence of the PCR product that 35 
contains both restriction sites minus the signal sequence and 
5' UTR was as follows (SEQ ID NO: 4): 



5-CC 


GCG 


GAT 


CCC 


GAA 


GAA 


TCT 


CAA 


AAG 


AAA 


CTGCCT 




TCT 


TCA 


CGT 


ATA 


ACC 


TGT 


CGT 


CAA 


GGC 


TCT 


GTA 


GCC 


TAT 


GGG 


TCC 


TAT 


TGC 


TAT 


TCA 


CTG 


ATT 


TTG 


ATA 


CCA 


CAG 


ACC 


TGG 


TCT 


AAT 


GCA 


GAA 


CTA 


TCC 


TGC 


CAG 


ATG 


CAT 


TTC 


TCA 


GGA 


CAC 


CTG 


GCA 


TTT 


CTT 


CTC 


AGT 


ACT 


GGT 


GAA 


ATT 


ACC 


TTC 


GTG 


TCC 


TCC 


CTT 


GTG 


AAG 


AAC 


AGT 


TTG 


ACG 


GCC 


TAC 


CAO 


TAC 


ATC 


TGG 


ATT 


GGA 


CTC 


CAT 


GAT 


CCC 


TCA 


CAT 


GGT 


ACA 


CTA 


CCC 


AAC 


GGA 


AGT 


GGA 


TGG 


AGG 


TGG 


AGC 


AGT 




TCC 


AAT 


GTG 


CTG 


ACC 


TTC 


TAT 


AAC 


TGG 


GAG 


AGG 


AAC 


CCC 


TCT 


ATT 


OCT 


GOT 


GAC 


CGT 


GGT 


TAT 


TGT 


GCA 


GTT 


TTG 


TCT 


CAO 


AAA 


TCA 


GGT 


TTT 


CAG 


AAG 


TGG 


AGA 


GAT 


TTT 


AAT 


TGT 


GAA 


AAT 


GAG 


CTT 


CCC 


TAT 


ATC 


TGC 


AAA 


TTC 


AAG 


GTC 


TAG 


GGC 


AGT 


TCT 


AAT 


TTC 


AAC 


AGC 


TTG 


AAA 


ATA 


TTA 


TGA 


AGC 


TCA 


CAT 


GGA 


CAA 


GGA 


AGC 


AAG 


TAT 


GAG 


GAT 


TCA 


CTC 




AGG 


AAG 


AGC 


ACT 


CGA 


GCC 


GGT 


C-3' 













4The bolded areas represent the primers. 
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EXAMPLE 3 



This example describes the creation of a plasmid contain- 
ing the expression construct. 

5 

Restriction enzyme digestion of the INGMAT PCR 
product and the pQE-31 vector 

We performed two parallel restriction enzyme digestion 

10 reactions using 2.5 /*g of both the INGMAT PCR product 
and pQE-31 vector. INGMAT was digested with Bam HI 
and Xho I simultaneously in a 30 ^1 volume. PQE-31 was 
digested with Bam HI and Sal I simultaneously in a 30 fd 
volume. Both digestion reactions were carried out at 37° C 

15 for a period of 4 hours. After the reactions were completed, 
400 ng of each was electrophoresed on a 15% agarose gel 
and stained with ethidium bromide to assure complete 
digestion. The remainder (~2.1 ug) of both digestion reac- 
tions were passed over a sepharose G-50 to remove the small 

20 DNA fragments followed by two equal volume phenol 
extractions. The extracted DNA was then precipitated with 
2 volumes of ethanol and Vio volume 3M sodium acetate at 
-80° C. for 20 minutes, centrifuged, washed twice with 70% 
ethanol and dried under vacuum. The pellets were resus- 

25 pended in 25 jil H 2 0 and quantified. 

The pQE-31 expression system was purchased from 
QIAGEN Inc. Chatsworth, Calif. 

30 Ligation of INGMAT into pQE-31 

INGMAT (Bam Hl/Xho I) and pQE-31(Bam Hi/Sal I) 
have compatible ends suitable for ligation. As a result of the 
ligation the Sal I restriction site in the vector will be 
35 eliminated. 

Ligation conditions using a 2:1 vector to insert molar 
ratio. 
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pQE31 (vector) 517 ng 9 fid 

LNGMAT (insert) 165 ng 2.5 fid 

10 X ligation buffer 5 fid 

10 mM rATP 5 fid 

T4 Ligase 4u 1 fid 

H 2 0 27.5 fid 

final volume - 50 fid 
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The ligation reactions were incubated at 4° C for 16 
hours. 

Transformation of the ligation reaction products 
into TOP10F competent E. coli 

We removed 5 fd of the ligation reaction into 100 [A of 
competent TOP10P cells, (TOP10P cells were purchased 
from Invitrogen, San Diego, Calif.) with 0.5 fA of 500 mM 
p-mercaptoethanol and incubated on ice for 30 minutes, heat 
shocked for 45 seconds at 42° C, and recovered on ice for 
2 minutes. Then we added 1 ml of prewarmed S° C media 
and incubated at 37° C with shaking at 225 rpm for 1 hour 
followed by plating all the transformation reaction on LB 
broth agar plates containing 100 jUg/ml ampicillin. 

Selection of transformants 

Colony containing plates were lifted onto Nytran mem- 
branes. The colonies were lysed with 0.5M NaOH, 
neutralized, and the resultant DNA bound to the membrane 
by baking at 80° C. for 1 hour. The membranes were then 
hybridized in 50% formaldehyde, 5xSSPE at 50° C. for 16 
hours with 3xl0 d cpm/ml of 32 P random primed INGAP 
cDNA. The membranes were washed at high stringency and 
exposed to X-ray film. Positive colonies were matched up to 
the X-ray film and grown up in 3 mis of LB with ampicillin. 

DNA isolation from positive transformants 

DNA was isolated from the small cultures using alkaline 
lysis, phenol extracted, precipitated, dried, and resuspended 
in 50 jWl H 2 0. A small aliquot of each of the isolated DNA 
were digested with Bam HI and Hind III to release inserts. 
The digested DNAs were electrophoresed on 1.5% agarose 
and stained with ethidium bromide and positive inserts 
identified at approximately 510 bp size range. We took four 
of the insert containing plasmids and incubated them in the 
presence of RNAse to remove any residual bacterial RNA. 

Transformation of the ligation products into MIS 

(pREP4) competent & coli so 

We removed 5 fA of the cleaned DNA isolated in section 
HE and transfer it into 100 §A of M15(pREP4) competent 
cells. The mixture was incubated on ice for 30 minutes, heat 
shocked for 45 seconds at 42° C, and recovered on ice for 55 
2 minutes. 1 ml of prewarmed SOC media was added and 
incubated at 37° C. with shaking at 225 rpm for 90 minutes. 
All of the transformation reaction was plated on LB broth 
agar plates containing 100 /ig/ml ampicillin and 25 #g/ml 
kanamycin. 60 

Selection of transformants for I NG MATH IS 
(INGMAT plus a six-histidine tag) protein 
production 

Eight colonies were picked and grown up in LB with 65 
ampicillin. DNA was isolated from the small cultures using 
alkaline lysis extraction procedures, phenol extracted, 



8 

precipitated, dried, and resuspended in 50 fA H 2 0. A small 
aliquot of each of the isolated DNA were digested with Bam 
HI and Hind III to release inserts. The digested DNA was run 
on 1.5% agarose gel and visualized by staining with 
5 ethidium bromide. Several of the transformants which dem- 
onstrated the plasmid with inserts of the correct size as well 
as the presence of the pREP4 plasmid were stored in 50% 
glycerol at -80° C. to be used for protein production. 

10 EXAMPLE 4 

This example describes denaturing metal affinity protein 
chromatography isolation of the his tagged INGAP protein 
without signal peptide. (Procedure for a 250 ml pING- 
MATHIS transformed M15 (pREP4) culture. pINGMATHIS 
15 is the INGMATHIS construct ligated into the pQE-31 
vector.) 

Bacteria growth and protein induction 

We grew a 25 ml overnight in LB with 100 g/ml ampi- 
20 cillin and 25 /*g/ml kanamycin antibiotic. We started a 250 
ml LB plus 100 jWg/ml ampicillin and 25 /*g/ml kanamycin 
culture with 5 ml of the overnight. (1:50) Grown until 
ABS600-0.0.75 to 0.9 (actual OD-0.866). Added 5 ml of 
100 mM IPTG (2 mM final) to induce production of the 
25 protein. Continue growing for 4 hours in the case of INGAP. 
Collected the bacteria and spin at 6000 rpm for 20 minutes, 
discarded the supernatant. The pellet was frozen until ready 
to use at -70° C. 

30 Ni +2 NTA agarose preparation 

Prepare as much as will be needed. (Use 10 ml of the 50% 
Ni +2 NTA for each 250 ml derived bacterial pellet). Place 16 
ml of the 50% slurry into a disposable 50 ml centrifuge tube. 
Centrifuge for 2 minutes at 800xg and discard the superna- 

35 tant. Add 42 ml of sterile water, resuspend the resin. Cen- 
trifuge for 2 minutes at 800xG and discard the supernatant. 
Add 42 ml of sterile water, resuspend the resin. Centrifuge 
for 2 minutes at 800xG and discard the supernatant. Add 42 
ml of binding/lysis buffer A (6M Guanidine HC1, 0.1M 

40 sodium phosphate, 0.01M Tris, pH 8.0) and resuspend the 
resin. Centrifuge for 2 minutes at 800xG and discard the 
supernatant Add 42 ml of binding/lysis buffer A (6M 
Guanidine HCL, 0.1M sodium phosphate, 0 01M Tris, pH 
8.0) and resuspend the resin. Centrifuge for 2 minutes at 

45 800xG and discard the supernatant. Add 42 ml of binding/ 
lysis buffer A(6M Guanidine HCL, 0.1M sodium phosphate, 
0.01M Tris, pH 8.0) and resuspend the resin. Centrifuge for 
2 minutes at 800xG and discard the supernatant. Bring the 
total volume up to 10 ml with buffer A. The shury is now 

50 ready for the application of the lysed bacteria. 

Bacteria lysis and protein isolation 

Thaw the bacterial pellet for 15 minutes at room tem- 
perature. Resuspend the pellet in 12^ ml of lysis buffer A. 

55 (6M Guanidine HCL, 0.1M sodium phosphate, 0.01M Tris, 
pH 8.0). Transfer the resuspension to a 50 ml centrifuge 
tube. Freeze the resuspension/lysate at -70 until solid. Thaw 
at room temperature. Place the lysate on a rotator for 60 
minutes at room temperature. Centrifuge the lysate for 15 

60 minutes at 10,000xG. Collect the supernatant and add the 10 
ml of prepared Ni2+NTA. Rotate for 45 minutes. Load the 
slurry onto a 1.6 cm diameter column and allow to flow 
through by gravity. 

Washes 

65 

Flow through 50 ml of buffer A. (No need to collect.) 
Flow through 40 ml of buffer B (8M Urea, 0.1M Sodium 
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phosphate, 0.01M Tris, pH 8.0). (No need to collect.) but 
should be at or near zero before continuing, if not, then 
wash with more. Wash through 40 ml of buffer C, same as 
B but pH 6.3. Collect 3 ml fractions. Wash through 40 ml of 
buffer D, same as B but pH 5.9. Collect 3 ml fractions. 5 

Wash through 40 ml of buffer E, same as B but pH 4.5. 
Collect 3 ml fractions. At this point the protein should be in 
one of the fractions taken. Read the absorbance at 280 of all 
the fractions to discern where the protein is. Pool, reduce, 10 
and SDS page electrophoresis as necessary. 

Dialysis 

In order to purify the expressed protein, we changed the 
carrier solution of the fraction extracted from the nickel/ 
NTA at pH 4.5 to Tris buffer using dialysis. Dialysis tubing 
with a molecular weight cut-off of 3000 was prepared by 
boiling in 5 mM EDTA/200 mM sodium bicarbonate for 5 
minutes. The tubing was rinsed briefly in deionized water 
and boiled another 5 minutes in the bicarbonate solution. 
The tubing was returned to deionized water, covered with 
aluminum foil and autoclaved for 10 minutes on a liquid 
cycle. The tubing was handled with latex gloves during the 
entire procedure. 

One ml of the protein solution from the nickel/NTA 
column in 6M guanidine HC1 was dialyzed against 4 liters 
of 25 mM Tris buffer at pH 8.5 for 12 hours. After dialysis, 
there were 2 mis of protein solution with a protein concen- 
tration of 800 ug/ml. 

EXAMPLE 5 

This example describes analytical techniques confirming 
the identity of the product 

SDS-PAGE 

in order to test for the overexpression of the INGAP ^ 
protein, discontinuous denaturing polyacrylamide gel elec- 
trophoresis was performed on the dialyzed protein solution 
using the Hoefer SE250 Mighty Small II apparatus. The 
separating gel was prepared with 15% acrylamide, 1.35% 
bis-acrylamide in 375 mM Tris buffer at pH 8.8 with 0.05% 45 
sodium dodecyl sulfate. Polymerization was induced by 
addition of 0.05% ammonium persulfate and 20 /il TEMED/ 
15 ml solution. The solution was placed in the gel plate 
apparatus for polymerization. The stacking gel was poured 
with the same solution, except the Tris buffer was 125 mM 5Q 
at pH 6.8, and the acrylamide concentration was 4%. The 
protein samples were diluted 1:1 with sample buffer (125 
mM Tris-O, pH 6.8, 4% SDS, 20% glycerol, and 10% 
2-mercaptoethanol). 

The upper and lower tank buffers were identical, contain- 55 
ing 25 mM Tris, 192 mM glycine and 0.1% SDS at pH 8.3. 
Two gels were loaded with 20 fA each of bacterial lysate 
without transfection (CBU368 ug/ml), bacterial lysate with 
transfection (TBL, 341 ug/ml), the fractions from Ni-NTA 
chromatography (eluted at pH6.3, 110 ug/ml; pH 5.9, 100 60 
ug/ml; and pH 4.5, 800 ug/ml) and standards (Rainbow 
Markers, Amersham and Dalton Mark-VII, Sigma). Elec- 
trophoresis was performed at 20 mA constant current until 
the dye front entered the separating gel, and at 60 mA 
constant current until the dye front reached 0.5 cm from the 65 
bottom. The gels were then removed and one was fixed with 
45% methanol/10% acetic acid for one hour, and the other 
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25 
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was placed in transfer buffer (25 mM Tris, 192 mM glycine, 
20% methanol, pH 8.3) for 20-30 minutes, 

^ Silver Staining 

The fixed gel was equilibrated with 2 changes of 10% 
ethanol/5% acetic acid for 30 minutes each. The gel was then 
exposed to a 0.0032N HNO.j/K2Cr207 solution for 5 min- 
iQ utes. The gel was washed in deionized water 3 times for 10 
minutes each. The gel was impregnated with silver using 0.1 
g AgN(y50 ml H 2 0 for 30 minutes. The silver solution was 
washed off the gel in deionized water for 5 minutes. The gel 
was then exposed to a developer solution (29.7 g anhydrous 
15 Na2C0 3 in 1 liter K^O with 0.5 ml formalin) in 5 minute 
intervals between changes until the desired density was 
reached. The development was stopped with 10% acetic 
acid, and the gel stored in H 2 0. 

The gel showed a protein band of approximately 19 kD 
that was prominent in the bacterial lysate from transfected 
cells and in the elution fraction from pH 4.5 on nickel/NTA 
(FIG. 1). This protein was not represented in any of the other 
samples. This is consistent with the size of INGAP protein 
and with interaction of the inserted histidine tagging region 
with the nickel/NTA column matrix. 

Western Blotting 

30 Immobilon-P PVDF membrane was wetted with 100% 
methanol, and equilibrated with transfer buffer for 10 min- 
utes. The gel was removed from transfer buffer and placed 
on the PVDF membrane. All bubbles between the membrane 
and the gel were removed. The combination was placed 

35 between Whatmann 3 mm filter paper wetted with transfer 
buffer and the whole "sandwich" was placed in the cassette 
of a Hoefer transfer tank. The cassette was placed in the 
transfer tank filled with transfer buffer with the gel toward 
the cathode. The transfer was performed at 12V constant 

40 voltage for 18 hours. 

After transfer, the membrane was placed in a blocking 
buffer of 0.5M Tris, 2M NaCl and 1% polyethylene glycol 
with 5% bovine serum albumin and 10% goat serum at room 

45 temperature for 1 hour. The membrane was then placed into 
20 ml of blocking buffer containing INGAP antibody 945-2 
at a dilution of 1:5000 and incubated at room temperature for 
1 hour. The membrane was then washed 3 times for 15 
minutes each with 50 ml of washing buffer (0.4% Tween-20 

^ in phosphate-buffered saline (PBS) at pH 7.4). The mem- 
brane was then incubated for 1 hour at room temperature in 
washing buffer containing anti-rabbit IgG (whole molecule, 
Sigma Cat # A-0545) peroxidase conjugate at a 1:160,000 
dilution. The membrane was washed 3 times for 5 minutes 

55 in 50 ml of 0.2% Tween-20 in PBS, followed by 3 washes 
of 5 minutes each with 0.1% Tween-20 in PBS. The blot was 
revealed using the enzyme chemiluminescence kit from 
Amersham Corp., Arlington, DL according to instructions. 
The ECL blot was exposed to Kodak X-Omat AR-5 X-ray 
film for 20 minutes. 

60 

ECL of the blot revealed strong protein recognition of the 
overexpressed 19 kD proteins in the whole lysate from 
transfected bacteria (IBL) and the pH 45 fraction that were 
visualized on the SDS-PAGE gels (FIG. 2). In addition, there 
65 was a protein band recognized in both bacterial lysates at 40 
kD, implying that this protein is weakly recognized and is a 
bacterial protein rather than a product of the traosfection 
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Finally, there was a light band at 14 kD recognized by the 
antibody in both the transfected bacterial lysate and in the 
pH 4.5 fraction. This may either be another protein or a lytic 
fraction of the INGAP protein. Given the engineering done 
to produce the INGAP protein it is most likely a lytic 5 
fraction of INGAP. 

In summary, we have been able to express INGAP protein 
in a prokaryotic system by excluding the $ UTR and the 
signal peptide and insertion of the new construct into a new 
vector. The resultant protein is of the predicted molecular 10 
size of INGAP monomer and reacts with the antibody to 
INGAP in a Western analysis. The protein shares with 
INGAP peptide the ability to induce ductal cell proliferation. 

EXAMPLE 6 15 

This example describes the experimental design 
employed for INGAP expression in eukaryotic systems. 

We generated an INGAP cDNAby PCR which excluded 
the 5' UTR region (nucleotides 1-16 in SEQ ID: 1) and 20 
nucleotides encoding the signal peptide (nucleotides 17-94 
SEQ ID NO: 1). The reason for excluding the 5* UTR region 
was to create a protein that is similar to the native protein in 
which the 5' UTR is not part of the protein. We also created 
two new restriction enzyme recognition sites enabling the 25 
insertion of the new construct into a new pEBVHis-B 
eukaryotic expression vector. This new ligated construct was 
transformed into INVaF competent cells (E. coli host strain 
from Invitrogen). The positive clones were identified, veri- 
fied by restriction enzyme digestion and the DNA isolated 30 
and transfected into COS-7 cells. The His-tagged protein 
was isolated by Ni +2 agarose affinity purification. The iso- 
lated protein showed biological activity when used to stimu- 
late proliferation of ARIP (ductal) cells as measured by 
3 H-TdR incorporation. 35 

We used a PCR approach to generate a new INGAP cDNA 
which excludes the S UTR region (nucleotides 1-16 in SEQ 
ID NO: 1) and nucleotides encoding the signal peptide. 

The sequence (SEQ ID NO: 1) that has been excluded is 
as follows: (the bolded area represents the sequence of the 40 
signal peptide) 

CTGCAAGACAGGTACCATG ATG CTT CCC ATG ACC CTC TGT 
MET MET Lea Pro MET The Leu Cys 
AGG ATG TCT TGG ATG CTG CTT TCC TGC CTG ATG TTC 
Arg MET Ser Trp MET Leu Leo Ser Cys Leu MET Phe 
CTT TCT TGG GTG GAA GGT 
Leu Ser Trp Val Gto Gly 



To engineer the new INGAP construct we designed oligo- 55 
nucleotides corresponding to the 5* and 3' ends of the INGAP 
sequence to be amplified. 

EXAMPLE 7 

This example describes the use of polymerase chain 60 
reaction to synthesize INGMAT (construct which lacks the 
signal peptide sequence, i.e., which encodes the mature 
protein). 

Oligonucleotide design 65 
Oligonucleotides for PCR were designed to incorporate 
restriction enzyme recognition sites at their respective 5' 
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ends. The oligonucleotide designed for the 5' end of the gene 
incorporates a Bam HI site followed by 20 nucleotides 
corresponding to the N-terminus of the mature protein. The 
oligonucleotide designed for the 3' end incorporates an Xho 
5 I site followed by 20 untranslated nucleotides. The PCR 
product generated from these primers contains the mature 
INGAP sequence and the native protein termination codon. 

The following is the sequence of the oligonucleotides 
used: 

10 

5' of INGAP (SEQ ID NO: 2 ) 

S'-CCGCGGArCCCGAAGAArCTCAAAAGAAACT^' 
3' of INGAP (SEQ ID NO: 3) 

5'- GACCGGCTCGAGTGCTCTTCCTGAGTGAATCC 



PCR of INGMAT 
Reaction conditions 

20 



Template: (50 ag INGAP original cDNA 5 /A 
removed from pCDNA3) 

MgQ 2 : 4/d 

10 X PCR buffer 5 jA 

25 dATP 1 fi\ 

dCTP IfA 

dGTP 1 (A 

dTTP IfA 

5' primer 1 /A 

3' primer 1 }A 

30 H 2 0 29 fil 

Taq polymerase 1 /A 



total volume = 50 /A 



35 cycle parameters 

A) 2 min at 95° C. 

B) 30 cycles of (1 min 95° C, 1 min 55° C, 1 min 72° 
C.) 

40 C) 7 min at 72° C. 

D) 4° C. until removed from thermal cycler. 



55 The PCR products were then electrophoresed on a 5% 
PAGE in TBE. Ethidium bromide stained PCR products 
corresponding to the expected size for the construct were cut 
from the gel. The gel fragments were electro-eluted into 0.5 
ml of TBE, precipitated with 50 /4 3M sodium acetate and 
1 ml of isopropanol at -80° C. for 20 min, centrifuged, 
washed once with 1 ml of isopropanol, washed once with 1 
ml of 70% ethanol, and then dried under vacuum. The dried 
pellet was resuspended in 50 /d H 2 0 and quantified. At the 

65 end of this step the sequence of the PCR product that 
contains both restriction sites minus the signal sequence and 
5' UTR was as follows (SEQ ID NO: 4): 
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5-CC 


GCG 


GAT 


CCC 


GAA 


GAA 


TCT 


CAA 


AAG 


AAA 


CTGCCT 




TCI 


TCA 


CGT 


ATA 


ACC 


TGT 


CCT 


CAA 


GGC 


TCT 


GTA 


GCC 


TAT 


CXXj 


TCC 


TAT 


TGC 


TAT 


TCA 


CTG 


ATT 


TTG 


ATA 


CCA 


CAG 


ACC 


TGG 


TCT 


AAT 


GCA 


GAA 


CTA 


TCC 


TGC 


CAG 


ATG 


CAT 


TTC 


TCA 


GOA 


CAC 


CTG 


GCA 


TTT 


CTT 


CTC 


AGT 


ACT 


GGT 


GAA 


ATT 


ACC 


TTC 


GTG 


TCC 


TCC 


CTT 


GTG 


AAG 


AAC 


AGT 


TTG 


ACG 


GCC 


TAC 


CAG 


Tip 


ATP 

AIL 


TGO 


ATT 


GGA 


CTC 


CAT 


GAT 


CCC 


TCA 


CAT 


GGT 


ACA 


CTA 


CCC 


AAC 


GGA 


AGT 


GGA 


TGG 


AGG 


TGG 


AGC 


AGT 




TCC 


AAT 


GTG 


CTG 


ACC 


TTC 


TAT 


AAC 


TGG 


GAG 


AGG 


AAC 


CCC 


TCT 


ATT 


GOT 


GCT 


GAC 


CGT 


GGT 


TAT 


TGT 


GCA 


GTT 


TTG 


TCT 


CAG 


AAA 


TCA 


GGT 


TTT 


CAG 


AAG 


TGG 


AGA 


GAT 


TTT 


AAT 


TGT 


GAA 


AAT 


GAG 


CTT 


CCC 


TAT 


ATC 


TGC 


AAA 


TTC 


AAG 


GTC 


TAG 


GGC 


AGT 


TCT 


AAT 


TTC 


AAC 


AGC 


TTG 


AAA 


ATA 


TTA 


TGA 


AGC 


TCA 


CAT 


GGA 


CAA 


GGA 


AGC 


AAG 


TAT 


GAG 


GAT 


TCA 


CTC 




AGG 


AAG 


AGC 


ACT 


CGA 


GCC 


GGT 


C-3' 













^The bolded areas represent the primers. 

EXAMPLE 8 

This example describes the creation of a plasmid contain- 
ing an expression construct for expression in eukaryotic 30 
systems. 

Restriction enzyme digestion of the INGMAT PCR 
product and the pEBVHis-B vector 

We performed two parallel restriction enzyme digestion 
reactions using 2.5 //g of both the INGMAT PCR product 
and pEB VHis-B vector. INGMAT was digested with Bam 
HI and Xho I simultaneously in a 30 /d volume. pEB VHis-B 
was digested with Bam HI and Xho I simultaneously in a 30 ^ 
fi\ volume. Both digestion reactions were carried out at 37° 
C for a period of 4 hours After the reactions were 
completed, 400 ng of each was electrophoresed on a 1 5% 
agarose gel and stained with ethidium bromide to assure 
complete digestion. The remainder (~2.1 fig) of both diges- 45 
tion reactions were passed over a sepharose G-50 column 
twice to remove the small DNA fragments followed by two 
equal volume phenol extractions. The extracted DNA was 
then precipitated with 2 volumes of ethanol and Vio volume 
3M sodium acetate at -80° C. for 20 minutes, centrifuged, 5Q 
washed twice with 70% ethanol and dried under vacuum. 
The pellets were resuspended in 25 /A Rfl and quantified. 

The pEBVHis-B expression system was purchased from 
INVITROGEN Corp. San Diego, Calif. 

55 

Ligation of INGMAT into pEB VHis-B 

INGMAT (Bam Hl/Xho I) and pEBVHis-B(Bam Hl/Xho 
I) have compatible ends suitable for ligation. 

ligation conditions using a 20:1 insert to vector molar 60 
ratio. 



pEBVHis-B(vector) 62 ng 1 fil 

INGMAT (insert) 80 ng 4 /il 

10 X Ligation buffer 1 fd 

10 mM rATP 1 ft\ 
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-continued 



T4 Ugase 4u 1 /Jtl 

H 2 0 2f& 



final volume - 10 /zl 



The ligation reactions were incubated at 12° C for 16 
35 hours. 

Transformation of the ligation reaction products 
into INVaF' competent E. coli 

We removed 5 fA of the ligation reaction into 100 fA of 
40 competent INVaF cells, (INVaF cells were purchased 
from Invitrogen, San Diego, Calif.) with 0.5 {A of 500 mM 
(3-mercaptoethanoI and incubated on ice for 30 minutes, heat 
shocked for 45 seconds at 42° C, and recovered on ice for 
2 minutes. Then we added 1 ml of prewarmed SOC media 
45 and incubated at 37° C. with shaking at 225 rpm for 1 hour 
followed by plating all the tracsformation reaction on LB 
broth agar plates containing 100 /ig/ml ampicillin. 

Selection of traosfomiants 

50 Six colonies were picked and grown up in LB broth with 
ampicillin. DNA was isolated from the small cultures using 
alkaline lysis extraction procedures, phenol extracted, 
precipitated, dried, and resuspended in 50 pd H 2 0, Small 

55 aliquots of each of the isolated DNA were digested with 
Bam HI and Xho I to release insert. The digested DNA was 
run on 1.5% agarose gel and visualized by staining with 
ethidium bromide. 
Several of the transformants that demonstrated the plas- 

60 mid with inserts of the correct size were stored in 50% 
glycerol at -80° C. Large plasmid DNA stocks were isolated 
from 250 ml LB overnight cultures for use in COS-7 cell 
transfections. 

COS7 transfection 

65 

The eukaryotic cell transfection was carried out according 
to method described by Chen and Okayama "High- 
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Efficiency Transformation of Mammalian Cells by Plasm" I 
DNA", Molecular and Cellular Biology, vol. 7, No 
August 1987, p 2745-2752). 

COS-7 cells (SV40 transformed African green monkcv 
kidney cells) were grown on twenty 150 mm diameter pla 
in culture medium (Dulbecco's modified Eagles medium. 
10% fetal bovine serum, penicillin/streptomycin) to 80'/? 
confmency. 

Each plate was washed twice with 10 mis of PBS, and 2'i 
mis of fresh culture medium added. The DNA transfectum 
mixture (2.5 ml) was added dropwise to each plate, swirled 
gently, and incubated overnight at 37° C. 

DNA transfection mixture: 



pEBVHfs-INGMAT 60 ng 0.080 ml 

H 2 0 1.045 ml 

2.5 M CaCL, 0.125 ml 

2XBES 1.25 ml 

Final volume * 2.5 mi 



The transfection media was removed from the plates The 
plates were then washed 3 times with culture medium, 
replenished with 25 ml of culture media, and incubated feu 
48 hrs. The plates were washed twice with PBS and 
trypsinized. The trypsinized cells were collected from 
groups of 5 plates, pelleted, and frozen with liquid nitrogen 

EXAMPLE 9 

This example describes denaturing metal affinity protein 
chromatography isolation of his tagged-lNGAP protein 
without signal peptide. (Procedure for 2 cell pellets from five 
150 mm plates each of pEBVHis-INGMAT transfected 
COS-7 cells) 

Ni+ 2 NTA agarose preparation 

Place 5 ml of the 50% slurry into a disposable 50 nil 
centrifuge tube. Centrifuge for 2 minutes at 800xg and 
discard the supernatant. Add 42 ml of sterile water, resus- 
pend the resin. Centrifuge for 2 minutes at 800xG and 
discard the supernatant. Add 42 ml of sterile water, resus- 
pend the resin. Centrifuge for 2 minutes at 800xG and 
discard the supernatant. Add 42 ml of binding/lysis buffer A 
(6M Guanidine HQ, 0.1M sodium phosphate, 0.01M Tris, 
pH 8.0) and resuspend the resin. Centrifuge for 2 minutes at 
800xG and discard the supernatant. Add 42 ml of binding 
lysis buffer A (6M Guanidine HCL, 0.1M sodium phospbax, 
0.0 1M Tris, pH 8.0) and resuspend the resin. Centrifuge for 
2 minutes at 800xG and discard the supernatant. Add 42 ml 
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of binding/lysis buffer A (6M Guanidine HCL, 0.1M sodium 
phosphate, 0.0 1M Tris, pH 8.0) and resuspend the resin . 
Centrifuge for 2 minutes at 800xG and discard the super- 
natant. Bring the total volume up to 5 ml with buffer A. The 
5 slurry is now ready for the application of the transfected 
COS-7 cell extract. 

Transfected COS-7 cells lysis and protein isolation 

30 Resuspend the transfected cell pellets in 2.5 ml of lysis 
buffer A. (6M Guanidine HCL, 0.1M sodium phosphate, 
0.01M Tris, pH 8.0). Combine two resuspensions into one 
for a final volume of 5 ml. The lysed cells were passed 
through an 18-gauge needle 4 times, transferred to a 15 ml 

15 centrifuge tube, and centrifuged for 15 minutes at 10,000xG. 
The supernatant was collected and 5 ml of prepared Ni 2+ 
NTA was added. The mixture was rotated for 45 minutes. 
The slurry was loaded onto a 1.6 cm diameter column and 
allowed to flow through by gravity. 

20 

Washes 

Flow through 30 ml of buffer A. (No need to collect.) 
How through 30 ml of buffer B (8M Urea, 0.1M Sodium 

25 phosphate, 0.01M Tris, pH 8.0). ( No need to collect.) but 
AjgQ should be at or near zero before continuing, if not, then 
wash with more. Wash through 20 ml of buffer C, same as 
B but pH 6.3. Collect 3 ml fractions. Wash through 20 ml of 
buffer D, same as B but pH 5.9. Collect 3 ml fractions. 

30 Wash through 20 ml of buffer E, same as B but pH 4.5. 
Collect 3 ml fractions. At this point the protein should be in 
one of the fractions taken. Read the absorbance at 280 of all 
the fractions to discern where the protein is. Fractions 
containing the protein were pooled, concentrated, and ana- 

35 lyzed by Western blot to confirm identity of the protein. 

This example describes an analytical technique confirm- 
ing the identity of the product. 

Biological activity of the expressed protein 

40 

The ability of the expressed protein to stimulate cell 
proliferation was tested on ARIP cells These cells exhibited 
a 50% increase m 3 H-TdR incorporation, at doses of the 
protein of 10-100 ng/ml. 

45 In summary, we have been able to express INGAP protein 
in an eukaryotic system by excluding the 5'UTR and the 
signal peptide. The resultant protein is of the predicted 
molecular size of INGAP monomer and reacts with antibody 
to INGAP in a Western analysis. The protein shares with 
INGAP peptide the ability to induce ductal cell proliferation. 



SEQUENCE LISTING 

( i ) GENERAL INFORMATION: 

( i i i ) NUMBER OF SEQUENCES: 6 

( 2 ) INFORMATION FOR SEQ U) NO:i: 

( i ) SEQUENCE CHARACTERISTICS. 

( A ) LENGTH: 94 base pairs 
( B ) TYPE nucleic acid 
( C ) STRANDEDNESS single 
( D ) TOPOLOGY: linear 
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( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
CTGCAAGACA GGTACCATGA TGCTTCCCAT GACCCTCTGT AGGATGTCTT GGATGCTGCT 
TTCCTGCCTG ATGTTCCTTT CTTGGGTGGA AGGT 



( 2 ) INFORMATION FOR SEQ U) NO:2: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 31 base pairs 
( B ) TYPE: nucleic add 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

CCGCGGATCC CGAAGAATCT CAAAAGAAAC T 



( 2 ) INFORMATION FOR SEQ ID NO:3: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

GACCGGC T CG AGTGCTCTTC CTGAGTGAAT CC 



( 2 ) INFORMATION FOR SEQ ID NO:4: 

< i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 558 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( j i ) MOLECULE TYPE: cDNA 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:4: 



CCGCGGATCC 


CGAAGAATCT 


CAAAAGAAAC 


TGCC TTCTTC 


ACGTATAAC C 


TGT CCTC AAG 


6 0 


GCTCTGTAGC 


CT ATGGGTCC 


TATTGCTATT 


C ACTGATTTT 


GAT AC CACAG 


ACCTGGTCTA 


1 2 0 


ATGC AGAACT 


ATCCTGCCAG 


AT GCATTTCT 


C AGGAC AC CT 


GGCATTTCTT 


CTCAGT ACTG 


1 8 0 


GTGAAATTAC 


CT TCGTGT CC 


TCCCTTGTGA 


AGAAC AGTTT 


GACGGCCTAC 


CAGTACATCT 


2 4 0 


GGATTGGACT 


CCATGATCCC 


TCACATGGT A 


C ACT ACCCAA 


CGGAAGTGGA 


TGGAAGTGGA 


3 0 0 


GCAGTTCCAA 


TGTGCTGACC 


TTCT ATAACT 


GGG AG AGG A A 


CCCCTCTATT 


GCTGCTGACC 


3 6 0 


GTGGTTATTG 


TGCAGTTTTG 


T CT CAGAAAT 


CAGGTTTTCA 


GAAGTGGAGA 


GATTTTAATT 


4 2 0 


GTGAAAATGA 


GCTTCCCT AT 


ATCTGCAAAT 


TCAAGGTCTA 


GGGCAGTTCT 


AATTTCAACA 


4 8 0 


GCTTG AAAAT 


ATTATGAAGC 


TCACATGGAC 


AAGGAAGCAA 


GT ATGAGGAT 


TCACTCAGGA 


5 4 0 


AGAGCACTCG 


AGCCGGTC 










5 5 8 



( 2 ) INFORMATION FOR SEQ ID NOA 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 26 amino acids 
( B ) TYPE: amino acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( j i ) MOLECULE TYPE: peptide 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

Mel Met Leu Pro Met Thr Leu Cys Arg Met Ser Trp Met Leu 
1 5 10 
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Cys Leu Met Pht Lti Ser Tip Val Glu Gly 

2 0 2 5 



( 2 ) INFORMATION FOR SEQ ID NO:6; 



( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 175 *nino acids 
( B ) TYPE: amino arid 
( C ) STRANDEDNESSc single 
( D ) TOPOLOGY; Smar 

( i i ) MOLECULE TYPE- prole* 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO;6: 

Met Met Leu Pio Met Thr Lett Cys Arg Met Ser Trp Mel Leo Leu Ser 
1 S 1 0 15 

Cys Leu Met Phc Let Ser Trp Val Glu Gly Gin Glu Ser Gin Lys Lys 

2 0 2 5 3 0 

Leu Pro Scr Ser At{ lie Thr Cys Pro Gin Gly Ser Val Ala Tyr Gly 
3 5 4 0 4 5 

Ser Tyr Cyi Tyr Ser Lee lie Leu Me Pro Gl* Tbi Trp Ser Asa At* 

5 0 5 5 6 0 

Glu Leu Ser Cys Gl m Mel His Phc Ser Gly His Leu Ala Pbe Leu Lea 

65 70 75 80 

Ser Thr Gly Glu lie Thr Phe Val Scr Ser Leu Val Lys Ass Ser Leu 
8 5 9 0 9 5 

Thr Ala Tyr Gin Tyi Me Trp Me Gly Leu His Asp Pro Ser His Gly 
100 LOS 110 

Thr Leu Pro Asn Gly Scr Gly Trp Lys Tip Ser Scr Ser Asn Val Leu 
L 1 5 12 0 12 5 

Thr Phc Tyr Asa Trp Gin Arg Asn Pro Ser Me Ala Ala Asp Arg Gly 

13 0 13 5 14 0 

Tyr Cys Ala Val Leu Ser Gin Lys Scr Gly Phc Gin Lys Trp Arg Asp 
14S 150 155 160 

Phc Asn Cys Gl a A * n Glu Leu Pro Tyr Me Cys Lys Phc Lys Val 



We claim. 

1. A recombinant construct for expression of Islet Neo- 45 ■ 
genesis Associated Protein or INGAP activity comprising: 

a first nucleotide sequence encoding amino acids 27 to 
175 as shown in SEQ ID NO: 6 operably linked to a 
transcriptional initiation site and a translation^ initia- i 
tion site, wherein a second nucleotide sequence encod- so 
ing a signal peptide is not present immediately 5' of i 
said first nucleotide sequence. 

2. The construct of claim 1 wherein nucleotides 1-16 of » 
SEQ ID NO: 1 are not present 5' of said first nucleotide 
sequence. 

3. The construct of claim 1 further comprising a third 55 
nucleotide sequence encoding a histidine tag. 

4 The construct of claim 3 wherein the third nucleotide 
sequence is immediately 5 1 or J to said first nucleotide 

sequence, 

5. The construct of claim 1 wherein the transcriptional 60 
initiation site is inducible. 

6. The construct of claim 1 wherein the transcriptional 
initiation site is the lac promoter/operator. 

7. The construct of clajrnlfe rther comprising, a promoter 
sequenc^ 

8. The construct of claim 7 wherein the [promoter 65 
sequences Rous sarcoma virus long terminal repeal 
(RSVLTRT — " 
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9. The construct of claim 1 further comprising a nucle- 
45 otide sequence encoding a nuclear antigen. 

10. The construct of claim 9 wherein the nuclear antigen 
is EBNA-1. 

11. The construct of claim 1 further comprising an origin 
of replication. t m 

50 12. The construct of claim 11 wherein the origin of 
replication is Epstein Bar Virus (EBV) origin of replication. 

13. A method of producing biologically active Islet Neo- 
genesis Associated Protein or INGAP protein from a recom- 
binant host cell comprising the steps of: 

culturing a host cell comprising a recombinant construct 
55 comprising a first nucleotide sequence encoding amino 
acids 27 to 175 as shown in SEQ ID NO: 6 operably 
linked to a transriptional initiation site and a transna- 
tional initiation site, wherein a second nucleotide 
sequence encoding a signal peptide is not present 
60 immediately 5 1 of said first nucleotide sequence, and 
recovering protein from said cultured host celL 

14. The method of claim 13 wherein the construct further 
comprises a third nucleotide sequence encoding a histidipe 
tag, and INGAP protein is purified using a nickel affinity 

65 matrix. 

15. A host cell comprising a recombinant construct com- 
prising a first nucleotide sequence encoding amino acids 27 
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to 175 as shown in SEP ID NO: 6 operaMv linked to a ini-KcA~*0^> 

transcriptional [Iron^site and a translationa! initiation site, — -* 

wherein a second nucleotide sequence^er^ojin^a^sign^l — — 
peptide is not present immediate!;^ *j|pf saidfirst nucleotide 
sequence, 5 

16. The construct of claim 1 wherein the first nucleotide 
sequence encoding amino acids 27 to 175 comprises nucle- 
otides 12-456 of SEO ID NO: 4. 
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17. The method of claim 13 wherein the first nucleotide 
sequence encoding amino acids 27-175 comprises nucle- 
otides 12-456 of SEQ ID NO: 4. 

18. The host cell of claim 15 wherein the first nucleotide 
sequence encoding amino acids 27-175 comprises nucle- 
otides 12-456 of SEQ ID NO: 4. 



* * * * * 
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19. The construct of claim 1 wherein the 
t r a n scri ptiona l init ia ti o n site is s elected from the 
group consisting of: Acl promoter, tac promoter, trp 
pro m oter, a n d tet p ro m ote r, 

20. The construct of claim 1 which comprises a 
n u cleo ti de se q uen ce as shown in SEQ ID NO; 4, 

2 1 . A p air of oligonucleo ti de pri mers for amplifying 
a portion of the human INGAP coding sequence, 
wherein said portion excludes the nucleotides 
encoding the signal peptide , wherein each of said 
oligonucle otide primers hybridizes to an opposite 
strand of a d o uble - strand ed INGA P template, wherein 
a first of said oligonucleotide primers hybridizes to 
the 5 } end of th e coding seq u e n ce, for matu re h uman 
INGAP an d the se cond of sa id oligonucleotide 
primers hybridizes to the 3' end of the nucleotide 
sequence e ncod i ng mature human. I NG AP, 

2 2. Th e p air of oligonu cl eotide p rim ers of claim 21 
wherein o ne primer has the nucleotide sequence 
s h own i n SEQ I D N O; 2 and on e primer has the 
nucleo tid e se q ue nce s hown i n S EQ ID NO; 3, 

23. A method of forming an expression construct for 
produ cing ING A P i n a re combinant host cell 
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comprising the step of; 

lin k ing a t ranscription i n itiation si te, a 
translation initiation site, and a coding sequence for 
mature .human INGAP, to for m a n ex pression 
const ruct which is devoid of the signal sequence of 
the coding sequence o f INGAP. 

24. The method of claim 23 further comprising 
linking to said coding se quence for mature human 
I NGAP a cod i ng seq u ence for a histidine t ag , 

2 5 , The method of claim 23 wh erein the tr anscription 
initiation site is inducible, 

26, The m etho d of cl a im 25 wher ein the tra n s cription 
initiation site is selected from the g ro u p consisting of 
the lac promoter/o perator, the tac promoter, the trp 
promoter, the Acl promoter, and the let promoter. 

27. The method of claim 23 wherein the coding 
se q u en ce for m at ur e hum an I NG AP is ob tained by 
amplific ati on of a p ortion, of the h um an I NG AP 
coding sequ ence , whe rein said portion excludes the 
nucleotide s en coding the signal p eptide, 



28. The method of claim 27 wherein the 



25 



amplificati on is performed using p rimers having 
se quences as shown in SEQ TP NO: 2 and SEQ TP 
NO; 3. 

29. A r ecomb i na nt co n struct fo r expressio n o f Islet 
Ne o g enesis A ssociated Protein (ING AP) a cti vi ty 
comprising; . 

a first nucleotide sequence en coding matu r e 
hu man ING AP, said fi rst n ucleo t ide sequence bein g 
operably linked to a transcriptional i nitiation site a nd 
a translational initiation site, wherein a second 
nucleo ti de se quence encoding a signal peptide 
according to SEQ IP NO;. 5 is not p resent 
immediat ely 5' o f said first nucleotide sequence, 

30. The construct of c laim 29 wherein nucleotides 1- 
16 of SEQ I P NO; 1 a re not p resent 5' of said first 
nucleotide sequence, 

31. The construct of claim 29 further comprising a 
third nucleotide sequence en co di ng ahistidine tag, .. 



3 2 , T he co ns tr uct of cla i m 29 wherein the third 
nucl eotide sequence is immedia tely 5 ' or 3' to said 
first nucleoti de se q uence, 

33. The construct of claim 29 wherein the 
transcriptional i nitiation s ite is inducible, 
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34. The construct o f claim 33 wherein the 
Iran scripti on a 1 i ni ti ati on site is the kc 

promoter/operator, 

35. The construct of claim 29 wherein the 
transcri ptional initiation site is capable of initiatillR 
constitutive transcription. 

36. The construct, of cla im 35 wherein the promoter 
sequence is Rous sarcoma virus long terminal repe at 
(R SVLTR ), 

37. The construct of claim 29 further comp rising a 
nucleotide se quence en coding a nuclear antigen, 

3R. The construct of claim 37 wherein the nuclear 

a ntigen is E BNA- 1, 

39. The construct of claim 29 further comprising an 
origin of replication. 

40. The construct of cl aim 39 wherein the origin of 
re plication is Epstein Bar Vims TEBV) o rigin of 
replication. 

41. The construct of claim 33 wherein the 

transcriptional initiation site is the Acl 

prom ote r /operator, 

42. The construct o f claim 33 wherein the 
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transcriptional initiatio n s i te is t he trp promoter. 

43. The construct of claim 33 wherein the 
transcriptional initiation site is the tac promoter, 

44. The construct of claim 33 wherein the 
tr a nscrip ti o n al i nitiat i o n s ite is t he t et p romoter, 

45. A method of producing biolo gically active Islet 
Neogenesis Associated Pr otein ( I NGA F) pr otein from 
a recombinant host cel l com pris i ng th e st eps of ; 

culturin g a host cell comp rising a recombinant 
co n s tr uct comprising a f irst n ucleo ti de se quence 
encodin g m ature huma n ING AP o p erably linked to a 
transcriptional initiation site and a translational 
ini ti atio n s it e, whe r ein a.second nu cleo ti de sequence 
e nc o d i ng a si gn al p e p tide according to SEQ ID NO;. 
5 is not present immediately 5' of said first nucleotide 
se q uence; a nd 

rec ove ri n g p ro te in from said cultured host cell 

46. T h e met h od o f claim 45 w he rein the construct 
f urt he r co m prises a thir d n uc le otide se quence 
encoding a histidine tag, and INGAP protein is 
purified. using a nickel a ffinity m a trix, 

47. A host cell comprising a re combinant construct 
co mpri si n g a first nucleo ti de se quence e ncoding 
mature h um an ING AP op erably l i nked to a 
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transcriptional initiation site and a translational 
initiation site, wherein a second nucleotide sequence 
encoding a signal peptide according to SEQ ID NO: 
5 is not present immediately 5 1 of said first nucleotide 
sequence, 
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[57] ABSTRACT 

Removal of the nucleotide sequence encoding the signal 
peptide from the INGAP coding sequence allows cultured 
cells to express substantial amounts of INGAP activity. 
Previous attempts have provided only low yields of INGAP, 
possibly because the signal sequence of INGAP is toxic to 
the cells 
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the specification of which 
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was filed on as Application Serial Number and was amended on 

(if applicable). 
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J37, Code of Federal Regulations, § 1.56(a). 
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(1) We believe that the original patent which issued on this application, U.S. 5,804,421, is wholly or 
partially inoperative or invalid by reason of the patentee claiming less than it had the right to claim and because of 
defects in the specification as detailed below: 

• Dependent claims 7 and 8 improperly refer to an additional element (a promoter sequence) which is 
in actuality already recited in independent claim 1 (as a transcriptional initiation site); therefore the 
promoter sequence is not an aditional element; 

• Applicants claimed less than they were entitled to claim in failing to claim oligonucleotide primers 
for amplifying the mature INGAP coding sequence; 

• Applicants claimed less than they were entitled to claim in failing to claim a method of forming an 
expression construct for producing INGAP. 

(2) All errors which are being corrected in the present reissue application up to the time of filing of this 
declaration arose without any deceptive intent on the part of the applicants. 

' (3) We hereby declare that all statements made herein of our own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these statements were made with 
Tthe knowledge that willful false statements and the like so made are punishable by fine or imprisonment, or both, 
junder Section 1001 of Title 18 of the United States Code and that such willful false statements may jeopardize the 
ryalidity of the application or any patent issuing thereon. 

i\ And we hereby appoint, both jointly and severally, as our attorneys with full power of substitution and 
Revocation, to prosecute this application and to transact all business in the Patent and Trademark Office connected 
therewith the following attorneys who are all members of the Bar of the District of Columbia, their registration 
numbers being listed after their names: 

% Donald W. Banner, Registration No. 17,037; Edward F. McKie, Jr., Registration No. 17,335; William W. 
"Beckett, Registration No. 18,262; Dale H. Hoscheit, Registration No. 19,090; Joseph M. Potenza, Registration No, 
-28,175; James A. Niegowski, Registration No. 28,331; Joseph M. Skerpon, Registration No. 29,864; Thomas L. 
^Peterson, Registration No. 30,969; Nina L. Medlock, Registration No. 29,673; William J. Fisher, Registration 
"No. 32,133; Thomas H. Jackson, Registration No. 29,808; Sarah A. Kagan, Registration No. 32,141, Patricia E. 

Hong, Registration No. 34,373; Robert S. Katz, Registration No. 36,402, Brian E. Hanlon, Registration No. 40,449, 

and Lisa M, Hemmendinger, Registration No. 42,653. 

All correspondence and telephone communications should be addressed to: Banner & Witcoff, Ltd., Eleventh 
Floor, 1001 G Street, N.W., Washington, D.C. 20001-4597, telephone number (202) 508-9100, which is also the 
address and telephone number of each of the above listed attorneys. 
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